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A METHOD FOR PERFORMING MARKET SEGMENTATION AND 
FOR PREDICTING CONSUMER DEMAND 

FIELD OF THE INVENTION 

5 The present invention relates generally to methods 

for performing market segmentation and to methods predicting 
consumer demand. More specifically, the present invention 
performs market segmentation by determining dissimilarity 
measures and predicts consumer demand by constructing a 

10 landscape of consumer demand through focused sampling. 

BACKGROUND 

The problem of determining what combination of 
factors in a given product (from toothpastes to cookies or 

!5 cars) will attract customers is a difficult one because the 
relationship between che factors may be highly nonlinear and 
because the ratings of factors by cuscomers may be biased for 
various reasons (formulation of the questions, rating scale, 
customer answers do not reflect accual preferences, etc.). 

20 For marketing purposes, a company would like to be able to 
find clusters composed of a sufficient number of customers 
wiwh similar preferences in order to either launch a new 
produce or adjust an existing product, adapted to such 
preferences. Of critical importance is that customers with 

25 similar sets of preferences be assigned to the same cluster 
and that customers with significantly different sets of 
preferences be assigned to different clusters. 

Clustering algorithms exist that can generate 
clusters that satisfy either one or both of these 

30 constraints. Multidimensional scaling methods go one step 
further to allow visualization of high- dimensional data 
clusters in a low-dimensional embedding apace. But 
clustering algorithms and multidimensional scaling methods 
always assume the existence of a well-defined metric or 

35 dissimilarity measure in attribute space, here the space of 
factors that contribute to a product. 
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Accordingly, there exists a need for a method for 
partitioning that provides both a relevant metric and a set 
of clusters. 

Next, a large body of mathematical and statistical 

5 work exiats concerning means to estimate the optimal 

composition of a good or service for a given customer, or 
population of customers. This body of work contains 
techniques, known in the art, such as CART, and discrete 
choice providing means for determining utility 'functions over 

10 a space of properties of a good or service for a given 
consumer, as well as means of considering a population of 
different customers with different preferences over that 
space of properties and attempting to "segment" the customer 
population into subgroups which may then be specifically 

15 targeted by marketing, or shifting the property mix of each 
product produced and the vector of products to "match 11 
optimally the customer population. Typically the aim is to 
maximize profit for the firm. 

The means in the art, in general, attempt to fit 

20 the observed data points by building up sketches of the 
utility surface for a given consumer or class of consumers 
using, in the simplest case, linear regression of the data 
points on all the property axes. Different classes of 
consumers are discriminated by discovering different linear 

25 regression patterns for different, e.g., demographic classes. 
In more sophisticated approaches, attempts are made to model 
the possibly curved properties of *isoultility M surf aces in 
the space of properties for a given consumer, or class of 
consumers, by fitting higher order polynomials to the sampled 

30 data. The generic problem with this approach is that the data 
sampled must be used zo estimate the coefficients of the 
monomial and polynomial terms, and the finite amount of data 
is typically used to characterize the lowest order terms, 
monomial, quadratic, etc, first. The data is typically "used 

35 up in obtaining reliable statistical estimates of these low 
order terms, and little or none is left over for use 
estimating higher order terms. 
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On the other hand, the higher order terms are 
precisely the measures of the complex context depend nt 
interactions among properties of a single good or service or 
a collection of goods or services. A trivial example is 
5 breakfast, consisting of ham and eggs. These two are 

traditionally called "consumption complements" by economists. 
The utility of ham for many consumers is much higher in the 
presence of eggs, than alone,- so too with egge. Another case 
is niche marketing of cellular telephones. Not only are these 
i0 phones of interest. to high volume users with expense 

accounts, but to low volume users who happen to be women with 
small children driving in rural areas and worried about an 
accident and no means of calling for help. The context 
dependence of the properties: woman, mother, with child in 
x5 car, accident and safety demonstrates the combinatorial 
character of one niche occupied by this product. 

On a simpler level, a given product, say soap, 
might be characterized by a number of features: color, shape, 
smell, saponin content, mass, etc. Or coca-cola packaging may 
20 be characterized by a number of properties, number of cans, 
size of cans, fluid in can, color of package, etc. 

Accordingly, there is also a need for a method for 
determining consumer demand that finds the context dependent, 
or combinatorial optimized set of properties, uses, or 
25 customer features that optimize the value of a product to the 
customer base. 



SUMMARY OF THE INVENTION 
The present invention presents a method for 
partitioning that provides both a relevant metric and a set 
of clusters through an evolutionary learning process. 

It is an aspect of the present invention to present 
35 a method for partitioning a space of data comprising the 
steps of: 

choosing a plurality of dissimilarity measures; 
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partitioning the space for each of said plurality 
of dissimilarity measures; 

evaluating said partitioning for each of said 
plurality of dissimilarity measures; and 
5 selecting one or more of said dissimilarity 

measures on the basis of said evaluation. 

The present invention further presents a method for 
determining consumer demand that finds the context dependent, 
or combinatorial optimized set of properties, uses, or 
10 customer features that optimize the value of a product to the 

customer base. 

It is an aspect of the present invention to present 
a method for determining customer demand for products 
comprising the steps of: 

defining a space having R dimensions wherein each 
point in said space corresponds to a vector of properties; 

constructing a landscape for said space comprising 
the steps of: 

locating at least one point on said space 
20 where a predetermined customer would purchase a product 
having said corresponding vector of properties at a 
predetermined price/ and 

sampling a set of points on an R- dimensional 
sphere surrounding said selected point at a predetermined 
25 step length from said selected point to determine a first 
subset of said set of points where the predetermined customer 
would make a purchase at said predetermined price and to 
determine a second subset of said sampled points where the 
customers would not make the purchase at said predetermined 
30 price, said first subset'of points and said second subset of 
points form at least one indifference surface between a 
buying region and a non-buying region at said predetermined 
price . 

The present invention further includes a framework 
35 for the marketing and introduction of novel products. The 
framework has means to model customers and to derive an 
optimal set of goods to produce alone or in the face of a 
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revolving co«P titiv. environment where other firm* are 
<„trodudng end modifying their own goods. 

It is . further eepect o£ the present invention to 
pr ..ent a method for creating a *»del of confer preference 

evaluating eaid plurality of candidate map. and 
10 „id family of egent-heeed model, with reepeot to eaid 
cceumer dec., or mte o£ s . id plu „ uty o£ 

cendidate maps »d said family of agent ba.ed model, beeed on 

said evaluation; and ^ lo ^. d 
performing at least one operation on sa.d selected 

15 A-*»r~ maos and said selected agent-based models to 

candidate maps ana family 
generate a new plurality of candidate maps and a new amily 

of agent -based models. 

BRIEF DESCRIPTION OP THE DRAWINGS 
20 FIG 1 provides a flow diagram of the adaptive 

dissimilarity partitioning method 100 of the present 
invention.^ ^ ^ & flow diagram of a method for 

25 determining consumer demand 200 that finds t>""^* 
" deoendent, or combinatorial optimized set of P^ tie9 « 
uses, or customer features that optimize the value of a 

prcduct to *■ --;-7 flow diagyam of the frarae wor k 300 

, , the marketing end Introduction of novel produoce. 
» £ " r ^ 4 provides . flow diegrem of the method for 

*n»n a model of customer preferences, 
creating a^odel & repre8entative computer system 

in conjunction with which the embodiments of the present 
35 invention may be implemented. 
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DETAILED DESCRIPTION OP THE PREFERRED EMBODIMENT 
The present invention presents methods for 
partitioning that provide both a relevant metric and a set of 
clusters through an evolutionary learning process called an 
_ ada p t ive dissimilarity partitioning methcd. Without 
5 notation, many of the following embodiments of the adapts 
dissimilarity partitioning method are explained in the 
illustrative context of market segmentation. However, it 
wiU "e apparent to persons of ordinary skill in the art that 
no the aspects of the embodiments of the invention are also 
applicable in any context in which the natural metric or 
dissimilarity measure of attribute space is not precisely 



Let us define a set of n m-dimensional data vectors 

x , n). The components (j-1 "0 °e «al 

" variables, binary variables, or other types of variables^ 
T»e aim of a typical clustering algorithm is to assign the 
data points to clusters to minimize some cost function. A 
pr otyoe vector is usually associated with each cluster: a 
,« clustef is defined as the set of data vectors that are closer 
20 the cluster's prototype than to any other F"*otyp.. 
example, in the k-means clustering algorithm one has to 
determine the coordinates of k prototype vec ors y> (h-1, 
.... k) to minimize the following cost function: 

25 

30 .here V l li *• "«^n.d «> cluster h and ^ - 0 
otherwise, end \ \ i» a dist.no. i» the speoe of o.« 
vectors. The K-««- clustering slgorlthn is 
S „»e methods for cUe.Uic.tion .nd snalysl. of «ultl vsri.te 
35 observations, McOueen. a.. Proa, Fifth Berkley m,. on 
" Katies! Statistic, ahd Probability, Vol. 1 W Cam, L. 
„. . Heyn,an. ede,. university of California Pre... 
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Berkeley, CA, 1965, pp. 2B1-297. An acceptable clustering 
solution is given by {m 4h }, where each data vector is assigned 
to one and only one cluster. In the k-means algorithm, the 
cluster prototypes are initialized with the first k data 
5 vectors. A new data vector x 4 , i>k, is assigned to the 

closest prototype vector y M1) . The prototype is adjusted in 
response to x w or, more precisely, is moved closer to x iS 

t 

10 



15 

The total adjustment of the prototype is normalized to the 
number of vectors that have already been assigned to that 
prototype- A randomized version of this algorithm, 
supplemented with topological constraints on prototypes, is 

20 the self -organizing map. an unsupervised neural network. 
Unsupervised neural networks are explained in The self- 
Organizing Map, Kohonen, T., 1990, the contents of which are 
herein incorporated by reference. We have assumed the 
existence of a well-defined distance \.\ . Sometimes, only 

25 pairwise (or higher-order) relationships among vector 

components are available. In such cases, the cost function to 
be minimized is the product of the dissimilarities of data 
vectors assigned to the same cluster. 

Multidimensional scaling (MDS) is used to represent 

30 data points in a two-or three-dimensional Euclidian space 
such that pairwise distances in representation space closely 
match pairwise dissimilarities as explained in 
Multidimensional Scaling, Cox, T. P. & Cox, M. A. A., Chapman 
& Hall, London, 1994 ( "Multidimensional Scaling 4 '), the 

35 contents of which are herein incorporated by reference. A 
clustering algorithm can be applied to the representation 
Vectors. Let y> be the vector that represents data vector x 4 . 
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Let d iu be the distance between two representation vectors y A 
and /a" K = |h"^fc &nd D " the dissimilarity between x, and 
The cost function (also called stress) to be minimized 
is typically given by: 

E=SIw lu (d iu -D iu ) 2 , 

M D-l 



10 

where the weights w lu are introduced to normalize the 
absolute values of the disparities D lu . A common choice for w lu 
is 



15 



1 



a-l 0-1 



20 Other definitions of stress and algorithms for minimizing 
stress are surveyed by Multidimensional Scaling. 

In both clustering and MDS, the initial 
dissimilarity measure is assumed to be known. Given the 
dissimilarity, a clustering algorithm provides clusters 
25 whereas MDS provides a low-dimensional representation. The 
obtained clusters or representations critically depend on the 
choice of the dissimilarity measure. Such a measure is 
usually defined on the basis of "intuitive"" criteria and 
relies on the "expertise" of the designer. Defining a 
30 dissimilarity measure, however, can in principle be 
automated. Clustering or scaling data, although it is 
sometimes used for exploratory data analysis, is usually a 
first "preprocessing 11 step in a particular task to be 
performed (compression, understanding, market segmentation, 
5 etc.) . The performance of clustering or MDS can therefore be 
8 measured not only with respect to the cost function or stress 
to be minimized but also in connection with the task to be 
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performed. The appropriate dissimilarity measure can be 
learned in a supervised manner on a training set, tested on a 
validation set, and applied to new data. The proposed 
learning algorithm is a genetic algorithm. Genetic 
5 algorithms are described in Genetic Algorithms in Search, 
Optimization and Machine Learning, Goldberg, D. E., Addison- 
Wesley, Reading, MA. , 1989 (Genetic Algorithms in Search, 
Optimization and Machine Learning) , the contents of which are 
herein incorporated by reference. 

FIG. i provides a flow diagram of the adaptive 
dissimilarity partitioning method 100 of the present 
invention. In step 102, the method 100 chooses a family of 
of distance functions or dissimilarity measures. In step 
104, the method 100 randomly generates a population of 

13 dissimilarity meaeuresD ¥ = {d^} or distance functions d v in 

the chosen family, where v is the index of a given 
dissimilarity measure in that population. Each -individual" 
v is encoded into a "genotype" . 

2Q In s tep 106, the method 100 performs clustering or 

multidimensional scaling with a given algorithm for each 
distance function or dissimilarity measure. In step 108, the 
method 100 evaluates the performance of clustering or 
multidimensional scaling and assigns fitness to every 

25 dissimilarity measure v. In step 110, the method 100 selects 
individuals on the basis of fitness. In step 112, the method 
100 applies operators to selected individuals and pairs of 
individuals. Preferably, the operations are genetic 
operators such as mutation and crossover. 

3Q * In 8t ep 114, the method 100 determines whether the 

partitioning results are satisfactory with respect to the 
fitness computed in step 108. If the partitioning results 
are not satisfactory, control returns to step 106 to perform 
clustering or multidimensional scaling for each new distance 

, 3 functions or dissimilarity measure created in steps 110 and 

* 3 112. If the partitioning results are satisfactory, control 
proceeds to step 116 where the method 100 terminates. 



v. 
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The distance function cr dissimilarity measure can 
be represented by a true function of the vectors' coordinatea 
or by a set of pairwiae relationahipa. When only pairwise 
relationships between data vectors are available one needs to 
5 generalize the dissimilarity measure to data vectors which 
have not been presented. The simplest generalization 
procedure is to use a locally linear interpolation, using the 
k nearest neighbors: the dissimilarity between the new vector 
V and any other vector 0 is given by the average 
10 dissimilarity between the k nearest neighbors of V and 0. 

The following examples illustrates the operation of 
the adaptive dissimilarity partitioning method 100. Let us 
assume for def initeness that each data vector x, is two- 
dimensional. The two components of x» represent two properties 
15 of a cookie, fcr example, sweetness and chewiyness. A set of 
n customers is asked to determine the respective levels of 
sweetness and chewyness they like in a cookie, on a scale of 
1 to 10 for each property. In addition, each customer is 
asked to tell which type of cookie he or she is currently 
20 using. Assume that k different types of cookies are 

represented. The distance function in the space of customer 
preferences is unknown. For example, one factor may be more 
important than another. A simple family of distance functions 



is: 

25 



1/2 



where f» and f, are, for example, second-degree polynomial 
functions of their variables. Each function is characterized 
by 15 parameters, the coefficients of the polynomials. The 
variations of these parameters is assumed to be restricted to 
[-10,10]. A clustering algorithm, such as k-means, is 
applied to the data set using this distance function. The 
fitness of a distance d T is given by 
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pv _ 

" 1 + M tn + M w * 

where M 4o is the number of customers assigned to the same 

5 cluster that do no buy the same cookie type and i9 the 
number of customers assigned to different clusters that buy 
the same cookie type. Depending on the task at hand, these 
two types of mismatch can be given different weights. 

The best individuals obtained after, say, 1000 

10 generations of the genetic algorithm, correspond to distance 
functions that allow to obtain the right clusters of 
customers in the sense described above. 

The adaptive dissimilarity partitioning method 100 
of the present invention finds the natural dissimilarity 

15 measure or distance function in a space of attributes. This 
function may be unknown. Instead of resorting to ad hoc 
functions, the method systematically generates a distance 
function adapted to the cask at hand. The obtained distance 
function reflects the true structure of the space of 

2c attributes and therefore can be used, in the context of 
market segmentation, to cluster customers, extract the 
"natural" clusters in the data using a non parametric 
clustering algorithm (that is, one in which in the number of 
clusters is not predefined) , extract the effective dimension 

25 of the space of preferences, test product differentiation, 
improve positioning by product adjustment, and test potential 
new products, taking into account the cost of moving from one 
product to another or of launching a new product. 

Other significant areas of application include 

30 protein data visualization, protein function and structure 
prediction, dimensionality reduction for virus data sets, 
general classification and pattern recognition problems, and 
data visualization, including database visualization and 
navigation. 

35 in another example, two hundred two-dimensional 

data vectors were randomly generated. Let x-^ and x tJ be the 
x- and y-coordinates of the ith data vector. x lt and x„ are 
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drawn from a uniform random distribution on 10,1] . Let us 
assume that x u and x la represent customer pr ferences for two 
selected features of a given product type, that two products 
are on the market, and that a customer i purchases product 1 
s if and only if x u <0.5 and purchases product 2 if and only if 
x n so. 5. in this example, therefore, only x u is relevant in 
the determination of what product is purchased by a customer 
whose preference vector is (x iw x la ) . But this information is 
not known to the analyst, who simply assumes that the 
relevant distance in preference space is, for example, the 
Euclidian distance. Using such a distance, the analyst will 
be unable to correctly segregate customers into two classes. 
What the algorithm has to find is the relevant distance in 
preference space that will naturally lead to the correct 
1S segregation after application of a simple clustering 
algorithm. Here we use a modified version of the k-means 
clustering algorithm with k=2. Two centroids are initially 
located at (0.5, 0.25) and (0.5, 0.75). Ideally, after 
application of the clustering algorithm with the appropriate 
20 distance function, the centroids should converge to (0.25, 
0 5) and (0.75, 0.5). Remember thaz with this clustering 
algorithm a data vector belongs to the cluster whose centroid 
is closest to that data vector. Let C, (i , be the centroid 
closest to vector x t ( Q,, » ArgMir{d(C mi X,)) , where d is the 

25 distance function), and C tisj the jth coordinate (j-1,2) of 
C nU , . The centroid update upon presentation of x, is given 
by: 



c^c^n — - — <r(H~ c m>> 



where d is the current distance function, o() is the sign 
35 function (o(u)-l if u>0, o(u)-l if u<0, and o=0 if u-0) , r 
is a learning rate, and n«20C is the number of data vectors. 
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The family of distance function U9ed in this example has 
three parameters: 



d{x„x h )* [w|x fl -x yl | <, +(2- w)\x n = xj'] 



2_ 
a+fi 



where w, a, and p e[0,2]. When w-1 and on (J«2, the usual 
Euclidian distance is recovered, and when w«=l and a«|J»l one 
10 9 ets the city-block (or Lj distance. 

This family of distance functions can easily be 
generalized to higher-dimensional spaces. For example, let 
us consider a D-dimensional space* 



15 



£ ™p x * - x j*>\ J*- 1 ' * 



20 with 



X>,=A 

25 

where a ? (p«l,.. • /&) and w p <p«l, . .,D) are 2D parameters (of 
which only 2D-1 are free parameters) that determine the 
relative importance of the pth coordinate and the amount of 
distortion along the pth coordinate. This family of 

30 functions assumes no correlation among coordinates, which is 
certainly a limitation in certain cases. Other distance 
functions should be used in such cases. 

For the simple two-dimensional example, a simple, 
fitness -proportionate (+elitism) genetic algorithm (GA) was 

35 used with the following fitness function for distance d v : 
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1 



5 

where K 1B is the number cf customers assigned to the same 
cluster, that do no purchase the same cookie type and NW is 
the number of customers assigned to different clusters 'that 
buy the same product. The population size was 40, the 
10 mutation rate 0.1, and crossover was replaced with averaging 
of oarametere (that is, two selected individuals produce one 
offspring the parameters of which are the arithmetic average 
of its parents' parameters) . After 10 generations, the GA 
finds values of the parameters that consistently produce a 
15 perfect clustering of customers after application of the 
modified Ic-means algorithm. During one application (200 
iterations) of the k-means algorithm for "bad" values of the 
parameters (w-0.96, a-1.81, P-l.77), close to the Euclidian 
distance, the centroids are unable to move to the optimal 
20 locations and remain confined in the vicinity of their 

initial values. For "good" values of the parameters found by 
the GA after 10 generations (w-1.98, a-1.67, 0=0.03), the 
centroids move to the optimal locations because the distance 
function assigns almost all the weight to the x-coordinate. 
25 The GA has therefore been able to find a distance function, 
within the family of distance functions, that reflects the 
"true" structure of preference space. 

Assume now that instead of being uniformly 
distributed in [0>H x [0,1] customers form four clusters 
30 (with the same "purchase- rule: customer i purchases product 
1 if and only if x 4i <0.5 and purchases product 2 if and only 
if x t - *0.5). Two situations can occur: the four clusters may 
discriminate along the y-axis or along the x-axis. Upon 
application of a non-parametric (number of clusters 
35 undefined) clustering or multidimensional scaling algorithm, 
the situation where the four clusters may discriminate along 
the y-axis should lead to the detection of 2 clusters while 
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the situation where the four clusters discriminate along the 
x-axis should lead to the discovery of 4 clusters if the 
appropriate distance function is used. If the Euclidian 
distance function is used both situations lead to the 

5 detection of 4 clusters. A non-parametric (ant -based) 
algorithm leads to 4 clusters in both cases using the 
Euclidian distance. The same algorithm leads to 2 clusters 
when applied to the situation where the four clusters 
discriminate along the y-axis and 4 clusters in the situation 

10 where the four clusters discriminate along the x-axis. 

In an alternate embodiment, for more complicated 
problems, general function approximators such as neural 
networks are used instead of family of distance functions. In 
the case of neural networks connections weights are evolved 

15 using the genetic algorithm, 

In another alternate embodiment, GA is interactive: 
the outcome of the clustering or MDS algorithm is evaluated 
by a human observer who picks the good solutions. 

The adaptive dissimilarity method 100 is also 

20 applicable to graph partitioning. Let G=(V,E) be a non- 
directed graph. V= (vjt.i „ is the set of n vertices and E, 

a subset of VxV, the set of edges, of cardinal |e|. E can be 
represented as a matrix [e^J of edge weights, e £j being the 
weight of edge (v 1# v 5 ) , where e l3 *0 if (v^Vj) e E and e 4j «0 

25 if (v t ,Vj) e E. The bipartitioning problem consists of 

finding 2 sets of n/2 vertices each such that the total edge 
weight between clusters is mimimal. This problem is known to 
be NP-complete, and many heuristics have been proposed to 
find reasonably good solutions in polynomial time. The 

30 question we may ask is the following: is there a natural 
distance in connection space (where the coordinate of a 
vertex v A is given by e 1Jf j n) such that the 
application of the k-means clustering algorithm (k=2) 
generates a good solution of the bipartitioning problem? 

35 The adaptive dissimilarity partitioning method. 100 

has been tested on random graphs G(n, c, p t , P t ) , where n»100 
is the total number of vertices, c=2 is the number of 
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clusters, p x is the probability that two vertices within a 
cluster are connected, and P. is the probability that two 
vertices belonging to different clusters are connected. 
Edges are characterized by e tj =l if (v^v,) e E and e^-O if 

5 (v.,^) e E. Such graphs are convenient to test the algorithm 
because the optimal solution of bipartitioning is known when 
c-2: the optimal partitioning solution consists of having as 
many vertices as possible that belong to the same graph 
cluster allocated to the same partition cluster. These graphs 

1C have been introduced and are used as difficult benchmark 
problems in the context of VLSI design. 

A modified version of the k-means algorithm is 
applied. The two eentroids are initially assigned random 
coordinates. Let C a(ii be the centroid closest to vertex v t 

15 ( C/1 , = ArgMin (d(C K , v t ) )) , where d is the currant distance 



function), and C K(1): the jth coordinate (j-1, n) of C, 
The centroid update upon presentation of Xi is given by: 



20 



c tm <- c nfi;j + I — - — - c. hK ), 



where d is the current distance function, o() is the sign 
function (o{u)-+l if u>0, o(u)— 1 if u<0, and o-O if u=0) , n 
is a learning rate, and n-200 is the number of data vectors. 
The family of distance function used in this example ha6 
three parameters: 



35 
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10 



where w e [0,1], and a and P e [0.2]. When w-1, one gets 
usual distances. The first term contains only zeroth and 
first-order relationships between the two vertices: this 

15 term is small when the two vertices are connected (Oth-order) 
and are connected to the same set of vertices (first-order) . 
The second term, which gets activated when w<l, represents 
second-order relationships between two vertices: this term is 
small when the neighbors of the two vertices have a lot of 

20 adjacent vertices in common. Such relationships ma/ be 
important for graph partitioning, but the extent to which 
they improve the partitioning is not known. 

The fitness function used in the GA for distance d v 

is given by: 

25 

1 

f = 



1 + E tat + 

30 



il 



where E lnt is the total inter-cluster weight, and n, is the 
number of vertices assigned to cluster 1. The |«,-2|term is 
there to favor well-balanced solutions. 

35 

The present invention further presents a method for 
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determining consumer demand that finds the context dependent, 
or combinatorial optimized set of properties, uses, or 
customer features that optimize the value of a product to the 

customer base. 

Previous work has developed a general model o£ 
5 rugged fitness landscapes called the NK model as explained in 
The Origin* of Order, Stuart A. Kauffman, Oxford University 
Press 1993. Chapter 2, the contents of which are herein 
incorporated by reference. The NK model is also explained in 
10 At Home in the Universe, Stuart A. Kauffman, Oxford 

University Press, 1995, Chapter 9, the contents of which are 
herein incorporated by reference. 

NK landscapes are members of a still more general 
class of models in physics, and known in the art as P spin 
1S models. A P spin model consists of N spins, each of which 
can take on a discrete number of values, say -1 and *l. 
and 0, or a,b,c,d. Each spin contributes an "energy" to the 
total energy of a system of N spins. The energy of a given 
so'n configuration of the N spins is given by the sum of the 
20 energies of the N spins. Bach spin's energy contribution is, 
in general, given by a sum of a monomial term which is a 
function of its own state, plus quadratic terms which are 
sums of energies chat are functions of the staces of all 
spins that influence it in pairwise interactions, plus a 
25 similar sum of cubic terms listing all the contributions of 
all t-iples of spins of which that spin is a member, plus 
higher order terms. In the NK model, K is the highest order 

coupling. . 

in such spin-glass models, the discrete system has 
30 a rugged "fitness- "cost" "efficiency" or "utility" landscape 
ove- the combinations of states of the N spins. New 
techniques have been developed to characterize a number of 
features of such landscapes. And it is these features that 
allow ready assessment of the importance of higher order, 
combinatorial properties on landscape structure. These 
properties include: 1) The number of peaks in the landscape, 
2) The expected number of steps to a peak from any given 
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point in the landscape. 3) The dwindling number of 
direction* "uphill" as the peek is climbed. 4) The number 
of different peeks that can be climbed from a single point on 
the landscape by adaptive walks which must proceed only 
s uphill. 5) The correlation structure of the landscape which 
is, roughly, the correlation between fitnesses at two points 
on' the landscape as a function of their distance. 

These properties of discrete landscapes, where the 
spins take on only discrete values, a,b,c,d... can be 
10 generalized to the case of continuous dimensions, where each 
variable is a real number. This continuous case, the lengths 
of walks uphill, and dwindling directions uphill must be 
parameterized by a -step length- in the space of reasonably 
smooth hill aides, any point on the landscape that is on a 
, s hillside has the property that, for infinitesimal steps away 
" from that point, half the directions are uphill and half are 
downhill. Only on ridges, saddles and peaks is that false. 
However, if a discrete step length, say 100 yards, is 
specified, then as a walk continues uphill and a ridge or 
20 saddle or oeak is approached, the -cone" that is still uphill 
will dwindle. The rate of dwindling is a measure that can be 
used to characterize the ruggedness of a continuous 
landscape. Thus, on NK landscapes, with K modestly large, 
the generic feature is that at every step uphill, the number 
„ of directions uphill falls by a constant fraction. As 
landscape ruggedness increases, the fraction by which turn 
directions uphill dwindles increases from a few percent to 
50% for fully random landscapes in the K - N 1 "random 
energy" limit. In a similar way, the rate at which the 
30 uphill cone decreases as walks uphill continue provides a 
measure of landscape ruggedness for continuous landscapes. 

Consider a oroduct space, without loss of 
generality taken to be soap. Features of this product were 
noted above, and in general, include other features of 
« interest. Consider, to be concrete and without loss of 

generality, discrete choice methods. A customer is presented 
with different choices of a bundle of properties, or vector 
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of properties. Each bundle is a point in the property space, 
A price is attached to each such point. The customer is 
asked to choose which, if any, he/sh would buy. Examination 
of the vector is the property space after a finite number of 

5 such choices, reveals a price in the vicinity of those 

positions in property space at which the customer will just 
stop buying. Thus, on one side of a point on a surface in 
property space, at that price, the customer will not buy, on 
the other side of a point on a surface in property space, at 

10 that price, the customer will not buy, on the other side he 
will buy. The point in question estimates the price for that 
specific vector of properties. By sampling at many points 
for one customer, it is possible to build up a set of points 
that escinates the utility curve, or surface, in property 

15 space at one price for that customer, hence an indifference 
surface, and a set of such surfaces at different prices. 

For a population of customers, a population of such 
data points can be assembled. In principle, much data could 
be obtained from each customer, but typically it is only 

2o feasible to obtain a limited amount of data from a given 
customer. Typically, this is obtained over a moderate large 
region of property space. The data points are then typically 
each labeled by a vector of demographic traits, and an 
attempt ie made using standard analysis to discriminate both 

25 high utility positions in the space of properties, and 

simultaneously the targeted demographic populations that are 
well matched to good positions in the space of properties in 
order to optimize the vector of goods produced, each at a 
different position in the property space, and targeted to one 
2Q or more positions in the demographic space, such that a total 
figure of merit such as total profit after total manufacture 
and sales. 

The application of landscape ideas can improve 
these standards procedures both by directing the limited 
35 sampling that can be done that it helps capture higher order 
terms, or context dependent features, of these marketscapes, 
helps build statistical models of the right "equivalence 
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class" of the real market scape, and helps build actual 
models of the actual marketscape. 

FIG. 2 provides a flow diagram of a method for 
determining consumer demand 200 that finds the context 

5 dependent, or combinatorial optimized set of properties, 
uses, or customer features that optimize the value of a 
product to the customer base. In step 2 02, the method for 
determining consumer demand 200 selects a point in property 
space that lies on a surface that divides a region where a 

10 predetermined customer would buy from a region where the 
predetermined customer would not buy. 

In step 204, the method for predicting consumer 
demand 200 samples a sew of points on an R-dimensional sphere 
surrounding the point selected in step 202. Step 204 

15 contrasts with previous methods for predicting consumer 

demand that sample widely over the product space. The radius 
of the sphere is defined in a well specified way where the 
radius is defined as the "step length" on the surface. An 
exemplary distance is the Euclidian distance. With the same 

20 customer, or more generally, the same class of customers, 
step 204 characterizes for many points in the spherical 
surface surrounding the point whose price has been 
determined, whether that new point would or would not be 
purchased by the customer at the given price. Since the true 

25 price surface in the space of properties contains the first 
determined point, that price surface will, in general, pierce 
the spherical surface surrounding the point whose price is 
determined. The points on the sphere which are purchased and 
the points which are not purchased determine, in the simplest 

30 case, a curve of points whose price is the transition between 
buying and not buying at the price. In this way, the 
neighborhood surrounding that first priced point can be 
examined . 

In step 206, the method for predicting consumer 
35 demand 200 determines whether the indifference surface has 
been substantially completed. If the method for predicting 
consumer demand 200 determines that the indifference surface 
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has not been substantially completed, control proceeds to 
step 208. In step 208, the method for predicting consumer 
demand selects another point on the indifference surface from 
the transition curve determined in step 204. After step 208, 

5 control returns to step 204. Step 204 samples a set of 
points on an R-dimensional sphere surrounding the point 
selected in step 208. In this fashion, the method for 
predicting consumer demand 200 operates to extend the 
indifference surface at the predetermined price in any 

10 direction in the property space. 

The ruggedness of the indifference surface at a 
given price will show up by any of the properties we have 
discussed. Thus, measured in property space, the 
indifference surface at a given price may have one or more 

15 correlation lengths in the space of properties. These 
correlation lengths, in the NK model are long, for K small, 
and short for K large. Thus, short correlation lengths 
estimate higher order couplings among the properties. The 
cone "uphill" in property space on an indifference surface at 

2o a given price can be determined. Good combinations of 
properties will show up as peaks or minima, depending upon 
direction of definition, in the surface. That is, a very 
good combination of properties in property space will show 
up, for example, as a willingness to pay the fixed price for 

2 5 a small "amount" of the given vector of properties. Having 
defined a local "peak" in the indifference landscape surface, 
we can define the typical walk length, given step size, the 
peak, and the number of peaks to which one can walk from any 
point, in addition, we can examine the similarity of peaks 

30 "climbed from the same or nearby points on the indifference 
landscape at a given price. We can ask if high peaks cluster 
near one another. We can ask whether recombination is a good 
means to find the high peaks. If so, we can search out the 
high peaks by focusing our questioning in precise ways, to 

35 look "between" the high peaks on the current landscape, and 
hill climb from those points to still higher peaks. 
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All these properties allow focused sampling of the 
landscape to estimate the higher order context dependent, 
combinatorial features of a given market scape. 

Scatisrical models of the sampled market scape can 

5 be built by utilizing P ©pin- like models, where che class of 
models with all possible values of the coefficients of all 
the Padic terms in the polynomials constitute the family of 
landscape models. Maximum entropy Bayesian updating similar 
techniques can then be used to estimate the most likely 

10 landscape parameters to fit the observed data. A major 

difference between the current approach and usual approaches 
is that the detailed sampling in specific regions of the 
indifference surface at a given price yields estimates of the 
how "high" the higher order terms, (K in the Nk model) 

15 actually are. Thus, we can estimate from such focused local 
measurements at several points on the landscape, that, for 
example, fifth order interactions, P=5, are critical for 
determining the local structure of the marketscape. Knowing 
that, we can use a preponderance of the data to fit or 

20 estimate the 5 5h order terms, and only a small amount of the 
data to estimate the monomial terms that may determine the 
overall non- isotropic features of the marketscape on long 
length scales across the marketscape. Thus, we can optimize 
use of the sampled data to discover both long range features 

25 of the landscape and local features. 

Given this analysis, one can derive a class of 
statistical models of the landscape, and specific models of 
the landscape. 

The method for predicting consumer demand 200 was 

30 explained in the context of computing an indifference surface 
for a predetermined price in the property space for a 
predetermined customer. However, as is known by one of 
ordinary skill in the art, the method for predicting consumer 
demand 200 could also be used to sample the property space of 

35 the product for a given class of customer at a predetermined 
price or at a set of predetermined prices. Further, the 
method for predicting consumer demand 200 could also be used 
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to arrange the demographically characterized population of 
customers into a customer-scape for any given point in the 
product space. This n w approach to market segmentation 
arises by casting the agents into an M dimensional 

5 demographic space. At any given price, we can determine the 
fraction of customers in any small volume of demographic 
space who will buy the good at that point in product space at 
the given price. This determines a "customer-scape" for that 
good at that price. Once again, the customer- scape is a 

10 landscape, and we can define all the properties noticed 
above: correlation structure, lengths of fixed step length 
walks to peaks, the dwindling cone uphill as peaks are 
climbed, the number of peaks accessible from a given point, 
the similarity of such peaks, and whether high peaks cluster 

13 near one another. In the latter case, recombination is a 
good means to search the landscape. This procedure defines 
one or more optimal customer features for a given good, or 
position in product space. The same procedure allows 
multiple points in product space to be utilized, indeed just 

20 the points normally utilized, to find the best set of 
positions in product space to match the best targeted 
populations of customers in customer space. Again, the 
advantage of our procedure is that it allows the higher order 
terms, the context dependent features in customer space, to 
25 be more readily detected, for it tells us that K order terms 
are important. Again, we can then construct statistical 
v models of customer-scapes, and models of specific customer 
scapes . 

The present invention further includes a framework 
30 for the marketing and introduction of novel products, which 
is a central function of businesses. PIG. 3 provides a flow 
diagram of the framework 300 for the marketing and 
introduction of novel products. The framework 300 concerns 
means to model customers and derive an optimal set of goods 
35 to produce alone or in the face of a coevolving competitive 
3 environment where other firms are introducing and modifying 
their own gcods. 
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In step 302, the framework for the marketing and 
introduction of novel products 300 assembles data on 
customers from statements of preferences on questionnaires, 
point of purchase data, neileon data, etc. In step 304, the 
5 framework for the marketing and introduction of novel 
produces 300 creates a model of customer preferences. In 
step 306, the framework 300 uses the models of customer 
preferences created in step 304 to identify preferred goods 
and services. In step 308, the framework considers the 
10 behavior of other firms in the environment in addition to the 
models of customer preferences created in step 304 to 
identify preferred goods and services in a coevolving 
competitive environment. 

PIG. 4 provides a flow diagram of the method for 
15 creating a model of customer preferences of step 304. In 
step 402 , the. method for creating a model of customer 
preferences 304 determines whether to perform market 
segmentation. If step 402 indicates that market segmentation 
should be performed, control proceeds to step 404 where the 
20 method for creating a model of customer preferences executes 
the adaptive dissimilarity partitioning method 100 shown by 
the flow diagram of FIG. 1. If step 402 indicates that 
market segmentation should not be performed, control proceeds 
to step 406. 

25 In step 406, the method for creating a model of 

customer preferences 304 constructs a family of linear or 
non-linear models of customers. These models are candidate 
maps from answers to questions, point of purchase data, etc. 
to the actual predictive preferences of the customers for the 

30 goods in question. Accordingly, an aim of the method for 
creating a model of customer preferences 304 is to order the 
goods in a match to actual preferences of customers. 

in step 408, the method for creating a model of 
customer preferences 304 constructs agent based models of 

35 customers based on default hierarchies, rules of thumb, etc. 
in their strategy space. Default hierarchies, etc. do not 
require that preferences be transitive, which is often true 
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of customers. In contrast, a preference space does requir 
transitivity. Agent based models of customers are described 
in A System and Method for the Synthesis of an Economic Web 
and the Identification of New Market Niches, Attorney docket 
5 number 9392-0007-999, filed May 15, 1998, the contents of 
which are herein incorporated by reference. Agent based 
models of customers are further described in An Adaptive and 
Reliable System and Method for Operations Management, 
Attorney docket number 9392-0004-999, filed July 1, 1999, the 
, 0 contents of which are herein incorporated by reference. 

In step 410, the method for creating a model of 
customer preferences 304 utilizes adaptive algorithms over 
the space of mappings produced by step 406 and the space of 
agent strategies produced by step 408 to find a set of models 
15 that predicts customer purchasing preferences for a set of 
goods. In the preferred embodiment, the adaptive algorithms 
are genetic algorithms. In an alternate embodiment, the 
adaptive algorithms are genetic programming. 

In step 412, the method for creating a model of 
20 customer preferences 304 determines whether the output of 
step 410 has produced good predictive models of customer 
purchasing preferences. If step 412 determines that the 
output of step 410 has not produced good predictive models of 
•customer purchasing preferences, control returns to step 406 
25 where processing proceeds with the new set of models produced 
by the adaptive algorithm of step 410. If step 412 
determines that the output of step 410 has produced good 
predictive models of customer purchasing preferences, control 
proceeds to step 414 where the processing terminates. 
3Q as previously discussed, in step 306, the framework 

300 uses the models of customer preferences created in step 
304 to identify preferred goods and services. If the 
customers have preferences for may features of a product that 
add up to a single preference landscape, then step 306 
35 executes the method for predicting consumer demand 200 

illustrated by the flow diagram of PIG. 2. In contrast, if 
the customer preferences are not commensurable, then step 306 
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executes an optimization tool to find the global pareto 
optimal points such as Configuration Sherpa, which is 
described in A System and Method for Coordinating Economic 
Activities Within and Between Economic Agents. In either 
5 case, one of ordinary skill in the art would understand that 
there are a variety of clustering and multi -dimensional 
scaling algorithms that can seek optimal choices of locations 
of goods in the product space to attract the most customers. 
Such algorithms may prespecify the number of goods, or seek 
10 optimal numbers and locations of goods based on a firm's 
budget constraints, and other aspects of firm operations in 
its competitive environment. 

As previously explained, in step 308, the framework 
considers the behavior of other firms in the environment in 
15 addition to the models of customer preferences created in 
step 304 to identify preferred goods and services in a 
coevolving competitive environment. Firms compete by 
introducing or improving products. Hence, there is a 
coevolutionary dynamic. Generically, there are two regimes: 
20 a "red queen* regime of persistent coevolution in the space 
of products and an evolutionary stable strategies regime 
where all products reach local or global Nash equilibria and 
stop moving in product space. See At Home in the Universe. 
If the firm completes the observe, orient, decide and act 
25 loop (OODA) faster than the other firms with respect to the 
introduction, innovation, improvement and wise placement of 
products, it can systematically win. 

Step 308 of the framework for the marketing and 
introduction of novel products 300 uses models of customers 
30 and capacity to predict preferences over the space of 

products to build agent based or other dynamical models of 
the coevolution of market shares of products, utilizing data 
to locate optimal positions for new or improved products in 
coevolutionary dynamics subject to constraints on budget, 
35 capacity, and time to market for new or improved goods, etc. 
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Agent based models that identify new products are described 
in A System and Method for the Synthesis of an Economic Web 
and the Identification of New Market Niches. 

FXQ. 5 discloses a representative computer system 

5 510 in conjunction with which the embodiments of the present 
invention may be implemented. Computer system 510 may be a 
personal computer, workstation, or a larger system such as a 
minicomputer. However, one skilled in the art of computer 
systems will understand that the present invention is not 

10 limited to a particular class or model of computer. 

As shown in FIG. 5, representative computer system 
510 includes a central processing unit (CPU) 512, a memory 
unit 514, one or more storage devices 516 # an input device 
516, an output device 520, and communication interface 2922. 

15 A system bus 524 is provided for communications between these 
elements. Computer system 510 may additionally function 
through use of an operating system such as Windows, DOS, or 
UNIX. However, one skilled in the art of computer systems 
will understand that the present invention is not limited to 

20 a particular configuration or operating system. 

Storage devices 516 may illustratively include one or 
more floppy or hard disk drives, CD-ROMs, DVDs, or tapes. 
Input device 518 comprises a keyboard, mouse, microphone, or 
other similar device. Output device 520 is a computer 

25 monitor or any other known computer output device. 
Communication interface 522 may be a modem, a network 
interface, or ether connection to external electronic 
devices, such as a serial or parallel port 

While the above invention has been described with 

30 reference to certain preferred embodiments, the scope of the 
present invention is not limited to these embodiments. One 
skill in the art may find variations of these preferred 
embodiments which, nevertheless, fall within the spirit of 
the present invention, whose scope is defined by the claims 

35 set forth below. 
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Claims 

1. A method for partitioning a space of data 
5 comprising the steps of: 

choosing a plurality of dissimilarity measures; 

partitioning the space for each of said plurality 
of dissimilarity measures; 

evaluating said partitioning for each of said 
10 plurality of dissimilarity measures; and 

selecting one or more of said dissimilarity 
measures on the basis of said evaluation. 

2. A method for partitioning a space as in claim 
15 l further comprising the steps of: 

performing at least one operation of said selected 
dissimilarity measures to generate a new plurality of 
dissimilarity measures; and 

repeating said partitioning the space step and said 
20 evaluating said partitioning step for each of said new 
plurality of dissimilarity measures; and 

selecting one or more of said new dissimilarity 
measures on the basis of said evaluation. 

25 3. A method for partitioning a space as in claim 

2 further comprising the step of iterating on said performing 
at least one operation of said selected dissimilarity 
measures step, said repeating said partitioning the space 
step snd said selecting one or more of said new dissimilarity 

30 measures step to achieve an optimal partition. 

4, A method for partitioning a space as in claim 
2 wherein said at least one operation is a genetic operation. 

35 5. A method for partitioning a space as in claim 

4 wherein said genetic operation is selected from the group 
consisting of a mutation and a crossover. 
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6. A method for partitioning a space as in claim 
1 wherein said choosing a plurality of dissimilarity measures 
st p comprises the steps of; 

choosing a family of dissimilarity measures; and 
5 randomly generating said plurality of dissimilarity 

measures from said chosen family. 

7. A method for partitioning a space as in claim 
1 wherein said partitioning for each of said plurality of 

10 dissimilarity measures step is performed by at least one 
clustering algorithm. 

8. A method for partitioning a space as in claim 
7 wherein said at least one clustering algorithm is a It-means 

15 clustering algorithm. 

9. A method for partitioning a space as in claim 
6 wherein said dissimilarity measure is a dissimilarity 
function and said family of dissimilarity measures is a 

20 family of dissimilarity functions. 

10. A method for partitioning a space as in claim 
9 wherein said family of dissimilarity functions for said 
space having two dimensions is: 



25 



» 



wherein 

x. w sea. *u»« and are said data in said space; 
30 f" and f, are polynomial functions of their variables; and 
f[ and f, are characterized by a plurality of parameters. 

11. A method for partitioning a space as in claim 
10 wherein said polynomial functions are second degree 
35 polynomial functions. 
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12. A method for partitioning a space as in claim 
10 wherein said plurality of parameters are coefficients of 
the polynomials. 

13. A method for partitioning a space as in claim. 
5 12 wherein the variations of said plurality of parameters are 

restricted to [-10,10]. 

14. A method for partitioning a space as in claim 
10 12 wherein said evaluating said partitioning for each of said 

plurality of dissimilarity measures step comprises the step 
of assigning a fitness to said each of said plurality of 
dissimilarity measures and said fitness is defined by: 

15 c-v 1 

wherein: 

M, is the number of said data that are assigned to the same 
20 partition that do not belong in the same partition; and 

M oue ' is che number of said data that are assigned to different 
' partitions and do belongin the same partition. 

15. A method for partitioning a space as in claim 
25 6 wherein said family of dissimilarity measures are general 

function approximators. 

16. A method for partitioning a space as in claim 
6 wherein said general function approximators are neural 

30 networks having connections weights. 

17. a method for partitioning a space as in claim 
1 wherein said evaluating said partitioning for each of said 
plurality of dissimilarity measures step is performed by a 

35 human observer. 
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18 a method for partitioning a space as in claim 
17 wherein said selecting one or more of said dissimilarity 
m easures on the basis of said evaluation step is performed by 
a human observer based on said evaluation. 

5 is. computer executable software code stored on a 

computer readable medium, the code for partitioning a space 
of data, the code comprising: 

code to choosing a plurality of dissimilarity 

l0 measures; u £ 

code to partition the space for each -of said 

plurality o£ dissimilarity measures, 

code to evaluate said partitioning for each of said 
plurality of dissimilarity measures; and 

code to select one or more of said dissimilarity 
" measures on the basis of said evaluation. 

20 computer executable software code stored on a 
computer readable medium, the code for partitioning a space 
, fl as in claim 19, the code further comprising: 

code to perform at least one operation of said 
selected dissimilarity measures to generate a new plurality 
of dissimilarity measures; 

code to repeat said partitioning the space step and 
25 said evaluating said partitioning step for each of said new 
plurality of dissimilarity measures, and 

code to select one or more of said new 
dissimilarity measures on the basis of said evaluation. 

21 A programmed computer system for partitioning 
30 a space comprising at least one memory having at least one 
region storing computer executable program code and at least 
one processor for executing the program code stored in said 
memory, wherein the program code includes, 

code to choosing a plurality of dissimilarity 

measures. 
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code to partition the' space for each of said 
plurality of dissimilarity measures; 

code to evaluate said partitioning for each of said 
plurality of dissimilarity measures; and 
5 code to select one or more of said dissimilarity 

measures on the basis of said evaluation. 

22. A programmed computer system for partitioning 
a space comprising at least one memory having at least one 

10 region storing computer executable program code and az least 
one processor for executing the program code stored in said 
memory as in claim 21, wherein the program code further 
includes : 

code to perform at least one operation of said 
15 selected dissimilarity measures to generate a new plurality 
of dissimilarity measures; 

code to repeat said partitioning the space step and 
said evaluating said partitioning step for each of said new 
plurality of dissimilarity measures; and 
20 code to select one or more of said new 

dissimilarity measures on the basis of said evaluation. 

23, A method for determining customer demand for 
products comprising the steps of: 

25 defining a space having R dimensions wherein each 

point in said space corresponds to a vector of properties; 

constructing a landscape for said space comprising 
the steps of: 

locating at least one point on said space 
30 where a predetermined customer would purchase a product 
having said corresponding vector of properties at a 
predetermined price; and 

sampling a set of points on an R-dimensional 
sphere surrounding said selected point at a predetermined 
35 step length from said selected point to determine a first 
subset of said set of points where the predetermined customer 
would make a purchase at said predetermined price and to 
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!l tin at least one indifference surface between e 
. ZZ > ncn-buyin, re 9 ic„ at eaic — 

price . 

24 A method for determining customer demand for 
products as In claim 23 wherein said constructing a landscape 
10 for said space further comprises the steps of: 
10 selecting at least one point on said .nd.fference 

surface; and^^ ^ ^ ^ ^ 

point to extend said at least one indifference surface. 

15 25 A me thod for determining customer demand for 

products as in claim 24 further comprising the step of 
iterating on said selecting at least one point on said 
ndlf ference surface step and said repeating said sampling 
20 afeP fromsaid selected point step to further extend said 
indifference surface. 

26 A method for determining customer demand for 
^ .« aa in claim 23 further comprising the steps of 

„ZZZZ cnaraccerietic, .« eeia inference eurfac. fro, 
said sampling step. 

27 A method for determini.ng customer demand for 
products as I, claim 26 wherein said indifference surface 

30 characteristics comprise a degree of ruggedness. 

28 A method for determining customer demand for 
products as in claim 26 wherein said indifference surface 
products as in correlation length, 
characteristics comprise at least one c 

35 29 a method for determining customer demand for 

products as in claim 26 further comprising the step of 
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locating one or more points on said indifference surface 
having a small amount of said corresponding vector of 
properties to identify peaks on said indifference surface. 

30 A method for determining customer demand for 
5 products as In claim 29 wherein said indifference surface 

characteristics further comprise at least one typical wal* 
length to said identified peaks. 

31 A method for determining customer demand for 
10 products as In claim 29 wherein said indifference surface 

characteristics further comprise at least one clusterxnc 
measure of said identified peaks. 

32 A method for determining customer demand for 
15 products as in claim 29 further comprising the steps of, 

defining a family of possible models to 

reoresent the customer demand, and 

selecting one or more models from said family 
20 of possible models that are compatible with said indifference 
surface characteristics. 

33 A method for determining customer demand for 
products as In claim 32 wherein said selected models have a 

25 plurality of parameters. 

34 A method for determining customer demand for 
products as in claim 33 further comprising the step of 
determining values of said plurality of parameters for said 

30 selected models from said sampling step. 

35 a method for determining customer demand for 
oroducts aa in claim 33 wherein said values of said plurality 
'of selected parameters are determined using Bayesian 

35 analysis. 
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36. Computer executable software code stored on a 
computer readable medium, the code for determining customer 
demand for products, the code comprising: 

code to define a space having R dimensions wherein 
5 each point in said space corresponds to a vector of 
properties; 

code to construct a landscape for said space 
comprising! •> 

code to locate at least one point on said 
10 space where a predetermined customer would purchase a product 
having said corresponding vector of properties at a 

predetermined price; and 

code to sample a set of points on an R- 
dimensional sphere surrounding said selected point at a 

15 predetermined step length from said selected point to 
"determine a first subset of said set of points where the 
oredetermined customer would make a purchase at said 
"predetermined price and to determine a second subset of said 
sampled points where the customers would not make the 

20 curchase ac said predetermined price, said first subset of 
"ooints and said second subset of points form at least one 
"indifference surface between a buying region and a non-buying 
region at said predetermined price. 

37 Computer executable software code scored on a 
" computer readable medium, the code for determining customer 
demand for products as in claim 36, wherein said code to 
construct a landscape for said space further comprises, 
code to select at least one point on said 

30 indifference surface; and 

code to repeat said sampling step from said 
selected point to extend said at least one indifference 
surface . 

38 Computer executable software code stored on a 
35 computer readable medium, the cede for determining customer 
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demand for products as in claim 37, the code further 
comprising: 

code to iterate on said selecting at least one 
point on said indifference surface step and said repeating 
s said sampling step from said selected point step to further 
extend said indifference surface. 

39 A programmed computer system for determining 
customer demand for products comprising at least one memory 
1fl having at least one region storing computer executable 
10 program code and at least one processor for executing the 
program code stored in said memory, wherein the program code 
includes: ^ ^ ^ space hav . ng r dimenaion8 wherein 

15 each point in said space corresponds to a vector of 
properties; 

code to construct a landscape for said space 

comprising: . . 

code to locate at least one point on said 

20 space where a predetermined customer would purchase a product 

having said corresponding vector of properties at a 

predetermined price; and 

code to sample a set of points on an R- 
• dimensional sphere surrounding said selected point at a 
25 predetermined step length from said selected point to 
" determine a first subset of said set of points where the 
predetermined customer would make a purchase at said 
predetermined price and to determine a second subset of 
sampled points where the customers would not ™* e <* e 
30 purchase at said predetermined price, said first 
30 points and said second subset of points form at lea one 
Indifference surface between a buying region and a non buying 
region at said predetermined price. 



d0 A programmed computer system for determining 
35 customer demand for products comprising at least one memory 
having at least one region storing computer executable 
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program code and at least one processor for xecuting the^ 
proU code stored in said memory as in claim 39, wherein 
sail code to construct a landscape for said space further 
comprises: 

code to select at least one point on said 

indifference surface; and 

code to repeat said sampling step .rom said 

<„- + n pvtend said at least one indifference 
selectea point to extena eai« 

surface . 

41 A programmed computer system for determining 
customer demand for products comprising at least one memory 
having at least one region storing computer encu^l* 
program code and at least one processor for executing the 
1S pr^-m code stored in said memory as in claim 40, wherein 
said code further comprises! 

code to iterate on said selecting at least one 
point on said indifference surface step and said 
Lid sampling step from said selected point step to further 
20 extend said indifference surface. 

42 A method for creating a model of consumer 
references from consumer data comprising the steps of; 
^ constructing a plurality of candidate maps form the 

25 consumer data to actual consumer preferences, 
" constructing a family of agent-based models, 

evaluating said plurality of candidate maps and 
■ said family of agent-based models with respect to said 

consumer data; 

selecting one or more of said plurality of 
30 . , A * am0v of aaent based models based on 

candidate maps and said family o£ agent t> 

said evaluation; and _«i»rt-ed 
rooming at 1«"= «• °P*»<=ion °- *" d " le0 " a 
candidate map* and «td ..l.«.d agent-based „=d«U to 
, 5 generate e new plurality of candidate -pa and a new family 
of agent -based models. 
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43. A method for creating a model of consumer 
preferences from consumer data as in claim 42 further 
comprising the step of iterating on said evaluating said 
plurality of candidate maps step, said selecting one or more 

5 of said plurality of candidate maps and said family of agent 
based models step and said performing at least one operation 
on said selected candidate maps and said selected agent-based 
models step to achieve an optimal model of consumer 
preferences. 

10 

44. A method for creating a model of consumer 
preferences from consumer data as in claim 42 wherein said at 
least one operation is a genetic operation. 

45. A method for creating a model of consumer 
preferences from consumer data as in claim 44 wherein said 
genetic operation is selected from the group consisting of a 
mutation and a crossover. 

2Q 4 6 . computer executable software code stored on a 

computer readable medium, the code for creating a model of 
consumer preferences from consumer data, the code comprising: 

code to construct a plurality of candidate- maps 
form the consumer data to actual consumer preferences ; 
25 code to construct a family of agent -based modelsj 

code to evaluate said plurality of candidate maps 
and said family of agent-based models with respect to said 

consumer dsta; 

code to select one or more of said plurality of 
30 candidate maps and said family of agent based models based on 
said evaluation; and 

code to perform at least one operation on said 
selected candidate maps and said selected agent-based models 
to generate a new plurality of candidate maps and a new 
35 family of agent -based models. 



- 39 - 



WO 00/02138 



PCTAJS99/15236 



47. A programmed computer system for creating a 
model of consumer preferences from consumer data, comprising 
at least one memory having at least one region storing 
computer executable program code and at least one processor 

5 for executing the program code stored in said memory, wherein 
the program code includes i 

code to construct a plurality of candidate maps 
from the consumer data to actual consumer preferences; 

code to construct a family of agent-based models; 
10 code to evaluate said plurality of candidate maps 

and said family of agent-based models with respect to said 
consumer data; 

code to select one or more of said plurality of 
candidate maps and said family of agent based models based on 
15 Baid evaluation; and 

code to perform at least: one operation on said 
selected candidate maps and said selected agent -based models 
co generate a new plurality of candidate maps and a new 
family of agent -based models. 

20 

48. A method for marketing and introducing novel 
products from consumer data comprising the steps of: 

creating a model of customer preferences; and v 
identifying novel products using the method for 
25 determining customer demand of claim 23, 

49. A method for marketing and introducing novel 
products from consumer data wherein said creating a model of 
customer preferences step is performing using the method of 

30 claim 1. 

50. Computer executable, software code stored on a 
computer readable medium, the code for marketing and 
introducing novel products from consumer data, the code 

35 comprising: 

code to create a model of customer preferences; and 
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code to identify novel products using the method 
for determining customer demand of claim 23. 

51. Computer executable software code stored on a 
computer readable medium, the code for marketing and 
introducing novel products from consumer data as in claim 50/ 
wherein said code to create a model of customer preferences 
is the code of claim 19. 
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